Large-Scale Speaker Ranking from Crowdsourced Pairwise Listener Ratings

نویسنده

  • Timo Baumann
چکیده

Speech quality and likability is a multi-faceted phenomenon consisting of a combination of perceptory features that cannot easily be computed nor weighed automatically. Yet, it is often easy to decide which of two voices one likes better, even though it would be hard to describe why, or to name the underlying basic perceptory features. Although likability is inherently subjective and individual preferences differ frequently, generalizations are useful and there is often a broad intersubjective consensus about whether one speaker is more likable than another. However, breaking down likability rankings into pairwise comparisons leads to a quadratic explosion of rating pairs. We present a methodology and software to efficiently create a likability ranking for many speakers from crowdsourced pairwise likability ratings. We collected pairwise likability ratings for many (>220) speakers from many raters (>160) and turn these ratings into one likability ranking. We investigate the resulting speaker ranking stability under different conditions: limiting the number of ratings and the dependence on rater and speaker characteristics. We also analyze the ranking wrt. acoustic correlates to find out what factors influence likability. We publish our ranking and the underlying ratings in order to facilitate further research.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speaker Intelligibility of A

A study of speaker intelligibility involving 45 speakers from a homogeneous accent group was carried out with adult, 7-8 year old and 11-12 year old listeners. Ranking of speakers according to their intelligibility was remarkably consistent across listener groups, thus demonstrating that inherent speaker characteristics are the primary contributor to intelligibility. In a second study, listener...

متن کامل

منطق گفتگو و غزل عرفانی

The logic of conversation is based on the speaker (I) , the hearer (you) and the referrent (He). It can take three forms: 1. the speaker talks to the listener about the referrent 2. The speaker talk about the listener to the listener 3. The speaker talks about himself to the listener. The present Paper elaborates on these issues and explains how this logic takes shape in mystical lyrics. The mo...

متن کامل

ARating-RankingMethod for Crowdsourced Top-k Computation

Crowdsourced top-k computation aims to utilize the human ability to identify top-k objects from a given set of objects. Most of existing studies employ a pairwise comparison based method, which first asks workers to compare each pair of objects and then infers the top-k results based on the pairwise comparison results. Obviously, it is quadratic to compare every object pair and these methods in...

متن کامل

Listener attitudes toward individuals with cerebral palsy who use speech supplementation strategies.

This study examined listener attitudes toward 7 speakers with dysarthria who implemented 3 speech supplementation strategies (topic cues, alphabet cues, and combined topic and alphabet cues) and a habitual speech control condition. Findings were similar, but not identical, to intelligibility results published in 2 earlier papers (K. C. Hustad, J. Auker, N. Natale, and R. Carlson, 2003; K. C. Hu...

متن کامل

Modeling Emotion Expression and Perception Behavior in Auditive Emotion Evaluation

In this paper, we consider both speaker dependent and listener dependent aspects in the assessment of emotions in speech. We model the speaker dependencies in emotional speech production by two parameters which describe the individual’s emotional expression behavior. Similarly, we model the listener’s emotion perception behavior by a simple parametric model. These models form a basis for improv...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017